Context-dependent outcome encoding in human reinforcement learning

نویسندگان

چکیده

A wealth of evidence in perceptual and economic decision-making research suggests that the subjective assessment one option is influenced by context. series studies provides same coding principles apply to situations where decisions are shaped past outcomes, is, reinforcement-learning situations. In bandit tasks, human behavior explained models assuming individuals do not learn objective value an outcome, but rather its subjective, context-dependent representation. We argue that, while such outcome context-dependence may be informationally or ecologically optimal, it concomitantly undermines capacity generalize value-based knowledge new contexts — sometimes creating apparent decision paradoxes.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Context-outcome associations underlie context-switch effects after partial reinforcement in human predictive learning

Predictive value for continuously reinforced cues is affected by context changes when they are trained within a context in which a different cue undergoes partial reinforcement. An experiment was conducted with the goal of exploring the mechanisms underlying this context-switch effect. Human participants were trained in a predictive learning situation in which a cue received partial reinforceme...

متن کامل

Context-dependent stopper encoding

Abstra t. A hara ter-based en oding method is presented for naturallanguage texts and geneti data. Exa t string mat hing from the en oded text is faster than from the original text, with medium and longer patterns. A ompression ratio of about 50% is a hieved as a by-produ t. The method en odes hara ters with variable-length odewords of 2-bit base symbols. An advan ed variant is ontext-dependent...

متن کامل

Reinforcement Learning through Neural Encoding

Recent progress in the field of Reinforcement Learning (RL) has enabled to tackle bigger and more challenging tasks. However, the increasing complexity of the problems, as well as the use of more sophisticated models such as Deep Neural Networks (DNN), has impeded the ability to understand the behavior of trained policies. In this work, we present the Semi-Aggregated Markov Decision Process (SA...

متن کامل

Context Tree Maximizing Reinforcement Learning

• Stochastic search approach [Nguyen et al 2011]: Parallel Tempering is utilized to find a good map based on the ΦMDP cost function. However, this approach is costly and does not guarantee finding the optimal state set or even a good one given a history •We propose an analytical and linear-time solution to this problem based on Context Tree Maximizing ∗Research School of Computer Science, Colle...

متن کامل

Linear Feature Encoding for Reinforcement Learning

Feature construction is of vital importance in reinforcement learning, as the quality of a value function or policy is largely determined by the corresponding features. The recent successes of deep reinforcement learning (RL) only increase the importance of understanding feature construction. Typical deep RL approaches use a linear output layer, which means that deep RL can be interpreted as a ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Current opinion in behavioral sciences

سال: 2021

ISSN: ['2352-1554', '2352-1546']

DOI: https://doi.org/10.1016/j.cobeha.2021.06.006